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A note on a maximal Bernstein inequality 
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We show somewhat unexpectedly that whenever a general Bernstein-type maximal inequality 
holds for partial sums of a sequence of random variables, a maximal form of the inequality is 
also valid. 
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1. Introduction and statement of main result 

Let Xi,X2, . . . , be a sequence of independent random variables such that for all i > 1, 
EXi = and for some k > and v>Q for integers m > 2, E\Xi\"^ < uto!k™~^/2. The 
classic Bernstein inequality (cf. [13], page 855) says that, in this situation, for all n > 1 
and t > 0, 






>t}<2 exp 



t^ 



2vn + 2fti 



Moreover (cf. [12], Theorem B.2), its maximal form also holds; that is, we have 



% 



P < max 

l<J<n 



4 = 1 



>t}<2 exp 



2vn + 2Kt 



It turns out that, under a variety of assumptions, a sequence of not necessarily indepen- 
dent random variables Ai, A2, . . . , will satisfy a generalized Bernstein- type inequality of 
the following form: For suitable constants A>0, a>0, &>0 and < 7 < 2 for all m > 0, 
n > 1 and t > 0, 



P{|S'(m-|-l,m-f 7i)| >i}< Aexp<^ - 



ai^ 



n + bti 



(1.1) 
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where, for any choice oi 1 < k < I < oo, we denote the partial sum S{k,l) = J2i=k-^i- 
Here are some examples. 

Example 1 . Let Xi , X2 , . . . , be a stationary sequence satisfying EXi = and 
VarXi = 1. For each integer n > 1 set 5'„ = Xi + • • • + X„ and B^ = Var(5„). Assume 
that for some CTq > we have B^ > a^n for all n > 1. Statulevicius and Jakimavicius [15] 
and Saulis and Statulevicius [14] prove that the partial sums satisfy (1.1) with constants 
depending on a Bernstein condition on the moments of Xi and the particular mixing con- 
dition that the sequence may fulfill. In fact, all values of 1 < 7 < 2 are attainable. Their 
Bernstein-type inequalities are derived via a result of [1] relating cumulant behavior to 
tail behavior, which says that for an arbitrary random variable ^ with expectation 0, 
whenever there exist 7 > 0, H > and A > such that its cumulants rfc(<^) satisfy 
\^k{0\ < (fc!/2)i+''i?/A'=-2 for fc = 2,3,..., then for allx>0 



P{±C > 4 < cxp| 2(H + (x/Ai/(i+27))(i+27)/(i+7)) /■ (^-^^ 

In Example 1, ^f = Sn/Bn and A = d^/n for some d > 0. 

Example 2. Doukhan and Neumann [4] have shown, using the result in (1.2), that 
if a sequence of mean zero random variables Xi,X2,..., satisfies a general covariance 
condition, then the partial sums satisfy (1.1). Refer to their Theorem 1 and Remark 2, 
and also see [8]. 

Example 3. Assume that Xi , X2 , • . • , is a strong mixing sequence with mixing coef- 
ficients a(7i), n > 1, satisfying for some c > 0, a{n) < exp(— 2cn). Also assume that 
EXi = and for some M > for aU i > 1, \Xi\<M. Theorem 2 of Merlcvcde, Peligrad 
and Rio [9] implies that for some constant C > for alH > and n > 1 , 

P{|5'„|>i}<expf ^^^^* ^^,^ -], (1.3) 

with Sn = YTt=\ ^t and where v"^ ^ sup,,>o(Var(Xi) + 2 J2j>i I cov{Xi,Xj)\) > 0. 

To see how the last example satisfies (1.1), notice that for any < 77 < 1 there exists a 
Di > such that for alH > and n > 1, 



nv 



+ M^ + tM(lognf < n{v^ + M^) + Dit^+'^ . (1.4) 



Thus (1.1) holds with 7 = 1 + r/ for suitable A > 0, a > and fe > 0. 
For any choice of 1 < i < j < 00 define 

M(z,j) = max{|5(z,z)|,...,|5(z,j)|}. 
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We shall show, somewhat unexpectedly, that if a sequence of random variables Xi, X2, . . . , 
satisfies a Bernstein- type inequality of the form (1.1), then, without any additional as- 
sumptions, a modified version of it also holds for M(l + m,n + m) for all m > and 
n> 1. 

Theorem 1. Assume that, for constants A> 0, a> 0, b>0 and 7 e (0,2), inequal- 
ity (1.1) holds for all ?n > 0,n > 1 and t > 0. Then for every < c < a there exists 
a C > depending only on A, a, b and 7 such that for all n>l, m > and t>0, 

P{M{m+l,m + n)>t}<Ccxpl ^^V^l- (1-5) 

Remark 1. Notice that though c < a, c can be chosen arbitrarily close to a. 

Remark 2. Theorem 1 was motivated by Theorem 2.2 of Moricz, Serfiing and Stout [11], 
who showed that whenever for a suitable positive function g{i,j) of {i,j) G {1,2, . . .} x 
{1,2, . . .}, positive function (j){t) defined on (0,c») and constant K > 0, for all 1 < i < 
j < 00 and i > 0, 

P{|5(*,j)| > < Kexp{-^{t)/g{i,j)}, 

then there exist constants < c < 1 and C > such that for all n > 1 and i > 0, 

P{M(l,7i) >t}< Ccxp{-c<Pit)/g{l,n)}. 

Earlier, Moricz [10] proved that in the special case when (j){t) = t^ one can choose c < 1 
arbitrarily close to 1 by making C > large enough. This inequality is clearly not appli- 
cable to obtain a maximal form of the generalized Bernstein inequality. 

Remark 3. We do not know whether there exist examples for which (1.1) holds for some 
< 7 < 1 and 6 > 0. However, since the proof of our theorem remains valid in this case, 
we shall keep it in the statement. 

Remark 4- The version of Theorem 1 obtained by replacing everywhere \S{m + 
l,m + n)\ by S{m + l,m + n) and M{m + l,m + n) by M^{m + l,m + n) = 
maxm+i<j<„_|_„i(5(r7i + 1, j) V 0) remains true with little change in the proof. 

Remark 5. Theorem 1 also remains valid for sums of Banach space valued random 
variables with absolute value | • | replaced by norm 1 1 • 1 1 . 

Remark 6. In statistics, maximal exponential inequalities are crucial tools to determine 
the exact rate of almost sure pointwise and uniform consistency of kernel estimators of 
the density function and the regression function. The literature in this area is huge. See, 
for instance, [2, 3, 5-7, 16] and the references therein. These results only treat the case 
of i.i.d. observations. Dependent versions of our maximal Bernstein inequalities should 
be useful to determine exact rates of almost sure consistency of kernel estimators based 
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on data that possess a certain dependence structure. In fact, some work in this direction 
has aheady been accompUshed in Section 4.2 of [4]. To carry out such an appUcation in 
the present paper is well beyond its scope. 

Theorem 1 leads to the following bounded law of the iterated logarithm. 

Corollary 1. Under the assumptions of Theorem 1, with probability 1, 

limsup ' ; ' ^' < -^. (1.6) 

Remark 7. In general, one cannot replace "<" by "=" in (1.6). To see this, let Y, 
Zi,Z2,.-. be a sequence of independent random variables such that Y takes on the 
value or 1 with probability 1/2 and Zi, Z2, ■ ■ ■ are independent standard normals. Now 
define Xi = YZi, i = l,2,. . .. It is easily checked that assumption (1.1) is satisfied with 
A = 2, a = l/2, 6 = and 7 = 1. When Y = 1 the usual law of the iterated logarithm 
gives with probability 1, 

limsup |S'(l,n)|/\/"loglogn = \/2 = l/Va, (1.7) 

whereas, when Y = the above limsup is obviously 0. This agrees with Corollary 1, which 
says that with probability 1 the limsup is <v2. However, we see that with probability 
1/2 it equals V2 and with probability 1/2 it equals 0. 

Theorem 1 is proved in Section 2 and the proof of Corollary 1 is given in Section 3. 

2. Proof of theorem 

The case & = is a special case of Theorem 1 of [10]. Therefore we shall always assume 
that b> 0. Choose any < c < a. We prove our theorem by induction on n. Notice that 
by the assumption, for any integer rig > 1 we may choose C > Auq to make the statement 
true for all 1 < n < no. This remark will be important, because at some steps of the proof 
we assume that n is large enough. Also, since the constants A, a, b and 7 in (1.1) are 
independent of ?Ti, we can assume m = without loss of generality in our proof. 

Assume the statement holds up to some n > 2. (The constant C will be determined in 
the course of the proof.) 

Case 1 : Fix a < > for which 

i'^<an (2.1) 

for some < a < 1 to be specified later. (In any case, we assume that an > 1.) Using an 
idea of [11], we may write for arbitrary l<fc<n, 0<g<l and p + q=l the inequality 

P{7\/(l,71+l)>t} 

< P{M(1, k)>t} + P{|S'(1, fc)| > pt} + P{M{k + l,n + 1) > qt}. 
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Let 

n + rb(q''-q^) 

u = . 

l + q^ 

Notice that 



qH'^ 



u + hti n-u + bq'^t^ 
Set 



(2.2) 



k^\u\. (2.3) 

Using the induction hypothesis and (1.1) we obtain 

P{A/(l,n+l)>t} 

(2.4) 



Notice that we chose k to make the first and third terms in the right-hand side of (2.4) 
ahuost equal, and since by (2.3) 

e qH^ 



k + bti ~ n-k + bqtti ' 

the first term is greater than or equal to the third. 

First we handle the second term in (2.4), showing that for < f < (an)^/''', 

f apH"^ \ ( cf 

exp< — > < cxp< 



k + bp'yfy } ~ [ n + i + bfy 

For this we need to verify that for < ^ < (cm)^/'*', 



k + bp-yfy n + 1 + bt-y 
which is equivalent to 

ap'^in + 1 + bt^) > c{k + bp^f) 
Using that 



""^ >:-^;tt:.. P.5) 



fc = [m] < u + 1 = 1 + -[n + b{q'' - q^)t^] 

l + q-^ 



it is enough to show 



c 



cb 



n[ap^- J J j +t''iap^b - cbp"' - ^ ^(9^ -q^)]+ap'-c> 0. 



A note on a maximal Bernstein inequality 1059 

Note that if the coefficient of n is positive, then we can choose a in (2.1) smah enough 
to make the above inequahty hold, even if the coefficient of f^ is negative. So in order 
to guarantee (2.5) (at least for large n) we only have to choose the parameter p so that 
ap^ — c > - which implies that 

a/ - -^ > (2.6) 

holds - and then select a small enough. 

Next we treat the first and third terms in (2.4). By the remark above, it is enough 
to handle the first term. Let us examine the ratio of Ccxp{— ci^/(fc + bf)} and 
Cexp{— ci^/(n + 1 + bf^)}. Notice again that since u + l> k, 

n + b{q'^~q^)t^ 

n + 1 — k>n — u = n 

1 + q'' 

_ q^n-b{q'' ~q'^)t'< 

" 1+^2 

> n ; 

l + q^ 

= : ciTi. 
At this point we need that < ci < 1. Thus wc choose a small enough so that 

g2-a&(qT-g2)>o. (2.7) 

Also, using t < {anY^'* , we get the bound 

(71 + 1 + bt'^){k + bt^) < 71^(1 + abf =: C2r^^ 
which holds if n is large enough. Therefore, we obtain for the ratio 
2/^1 1 M^ f ccit^ 



^^P^ -^^ I A^T^F^ - ^^TTT6^ j I - °n"^ ^ - '" 



whenever ccit^ /{c2n) > 1, that is, t > y/c2n/{cci). Substituting back into (2.4), for t > 
\/c2n/{cci) and t < (an)^''^ we obtain 

P{A/(l,71+l)>t} 

< (-C + a) exp{-ctV('^ + 1 + bf)} < Ccxp{-ct'^/{n + 1 + bt^)}, 

where the last inequality holds for C > Ac/{e — 2). 

Next assume that t < ■\/c2n/(cci). In this case, choosing C large enough, we can make 
the bound > 1, namely 

Cexpj-— ^ j > Ccxpj-^ j = Ce-/- > 1, 
[ n + l + bt^) [ cciTiJ 
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if C>e'=2/'=i. 

Case 2: Now we must handle the case t > (an)^''^ . Here we apply the inequality 

P{M(l,n+l)>f}<P{A/(l,n)>t}+P{|5(l,n + l)| >t}. 

Using assumption (1.1) and the induction hypothesis, we have 

P{M(l,n + l)>t}<Ccxp<^ ^^+^cxp<^ 



n + bV j [ n + l + bf^ 

We will show that the right-hand side < Cexp{—ct^/{n + 1 + bt"^)}. For this it is enough 
to prove 



n + btt n + l + W J ] C { n + l + bti^ 
First assume that 7 < 1. Using the bound following from t > (an)^/''', we get 



{n + bt^){n + bfi + 1) ~ (a^^ + b){2a-'^ + 6)^27 
We have that the right-hand side of (2.8) for a > c is less than 

for C large enough. 

For 1 < 7 < 2 we have to use a different argument. For t large enough (i.e., for n large 
enough, since t > (an)^'^) we have 

f Ct'^ 1 r 9 2-.T cc3i2-27 

We also have for C > A, 



A ( t'^(a — c): 1 , ^ u- 
■exp<^ ^ -^ )■ <cxpi -f-^ - 



C { n+l + br'j~ { 2a-i+6 

It is clear that since a> c, for t large enough, that is, for n large enough, 

cc3i^-27 r . a-c 
> exp< — t '- — 

The proof is complete. 
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3. Proof of corollary 

Choose any A > 1 and set nir ~ \\^~\ for r =1,2, Now, using inequality (1.5), we get 



P{M{l,mr) > \/ c imr+iloglogmr} 
mr+iloglog?7ir 



< Cexp 



mr + h(c ^TTlr+l loglogTOr)'''/^ 

Since as r — > oo 

m,.+iloglog?7i,. 

_ — -y^ = (l + o(l))Alogr, 

vdr + b[c m^+i loglogrrirj^' 

it is readily checked that for tq large enough so that log log 771,^0 > 0, 



y^ P{Af (IjTOr) > \/ C ^TTlr+l loglogTTlr} < OO 

and thus, since TOr+i/wr = A + o(l), we get by the Borel-Cantelli lemma that with 
probability 1 

hm sup < V Ac ^. (3.1) 

r^oo V ™r log log m^ 



Next we see that for all r > rg 



\S{\,n)\ ^ M{l,mr+i) 



mr<n<mr+i y/u log log n y^rn,. log log mr 
Thus by (3.1), with probability 1, 

1^(1, n)| 



lim sup max 
< lim sup 



tx3 m^<n<mr+i -^nloglogn 

M(l,m^+i) 



\/?71r log log TTtr 



M(l,mr+l) ^77V+l log log 771^+1 , 

= lim Slip - -I ^ < Av C^^. 

r^oo ^777^+1 log log 777r+l V "V fog fog ?T7r 

Hence, since A > 1 can be chosen arbitrarily close to 1 and c < a arbitrarily close to a, 
we have proved (1.6). 
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